A quantum circuit for OR 
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We give the first quantum circuit for computing /(0) OR 
/(f) more reliably than is classically possible with a single 
evaluation of the function. OR therefore joins XOR (i.e. par- 
ity, /(0)©/(l)) to give the full set of logical connectives (up to 
relabeling of inputs and outputs) for which there is quantum 
speedup. The XOR algorithm is of fundamental importance 
in quantum computation; our OR algorithm (found with the 
aid of genetic programming), may represent a new quantum 
computational effect, also useful as a "subroutine". 



All digital computing can be built out of Boolean func- 
tions. Suppose a physical process takes the two orthog- 
onal inputs |0), |1) to outputs |/(0)), |/(1)), so that / is 
just a (classical) Boolean function from {0, 1} to {0, 1}. 
This could be a quantum-coherent computer subroutine, 
or the evolution of some other physical system we are 
interested in jlj . If the process is a unitary quantum evo- 
lution, and we can prepare a desired state as input, then 
quantum computation lets one find out more about the 
function / than if we are restricted to evolution of clas- 
sical states. For instance, we can find out its parity /(0) 
XOR /(l). The XOR circuit in § was the first concrete 
demonstration of quantum computation's greater-than- 
classical power; the exact version in [|| is of fundamental 
importance in its own right and for its applications in 
more complex algorithms. 

In this Letter, we complete the demonstration of quan- 
tum computation's greater-than-classical power in this 
simple setting, by describing circuits which compute /(0) 
OR /(l) with one call to the subroutine /. While these 
circuits, unlike the XOR circuit, may err, we show that 
their performance is better than any possible classical 
circuit. This and XOR are the only quantum speedups 
in this simple domain. (NOR, AND, NAND, and NOT- 
XOR may also be sped up but they, and the algorithms 
that speed them up, are isomorphic to OR or XOR by 
simply relabeling of inputs and/or outputs.) 

The OR algorithm represents a new quantum compu- 
tational effect, which may have applications to more com- 
plex problems. Indeed, our circuits were derived as sub- 
routines of a better-than-classical routine (evolved using 
genetic programming) to compute a more complex prop- 



erty, "AND/OR," of Boolean black-box functions of two 
variables (/ : {0, l} 2 -> {0, 1}). OR and AND/OR form 
part of an infinite family of properties (uniform binary 
AND/OR trees) whose quantum complexity is still im- 
perfectly understood, but which have great theoretical 
and practical importance in computing, since they have 
applications in game tree evaluation, logic programming, 
theorem-proving, and many other areas, and their clas- 
sical query complexity is completely understood. 

The quantum complexity of Boolean properties of 
black-box functions has been studied in Here, we 

examine quantum gate arrays for certain properties of 
black box functions of one and two qubits. Given an 
unknown function / which may be called on particular 
inputs or coherently on superpositions of them, we wish 
to evaluate a Boolean property P of /. We are inter- 
ested in Ptoox' t ne maximum over functions of the prob- 
ability that an algorithm evaluates P(f) incorrectly, and 
Qmax, the maximum over functions of the expected num- 
ber of function queries. Exact algorithms have p e max = 0; 
Las Vegas algorithms are correct whenever they answer 
or 1, but may also answer "don't know" with p < 1/2. 
Monte Carlo algorithms may err, but p^ al < 1/2. The 
error is one-sided if there is a value x (0 or 1) such that 
p e = for / such that P(f) = x; otherwise it is two- 
sided. By n repetitions (and majority voting, for Monte 
Carlo), the latter two types may be made to have expo- 
nentially small (in n) probability of not giving the correct 
answer. Below, we use a definition in which Las Vegas 
algorithms may have stochastic runtime, but give correct 
answer with p = 1; DFP (described below) is an exam- 
ple. By running it repeatedly until an answer is obtained, 
the first type of Las Vegas algorithm may be converted 
into one of the second type with expected running time 
greater by a constant factor. 

Genetic programming (GP) [Q evolves a population 
of programs (in our case, sequences of quantum gates) 
which are randomly mutated, recombined with each 
other, and preferentially selected for desired properties 
by running (or simulating) the programs on a sample of 
inputs. The repertoire of gates used by our GP engines 
(square brackets indicate qubit references, parentheses 
real parameters) was: HADAMARD[q], U-THETA[q] (9):= 
cos 01 + ismOa,,, U2fql(a, 6. 6, ib) := e^e - ^ 7 * (cos 91 - 
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y , U2[q](a,0,0,V>) := 

CN0T[control, target]; 



CPHASE[control, target] (a) , which multiplies each 
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standard basis state by e ta if it has 1 in both control 
and target positions; ORACLE [qi, ...q n , q out ], which adds 
(mod 2) f(q 1 , ...,q n ) to q out , retaining q u ...q n unchanged; 
and MEASURE- [q] and MEASURE- 1 [q] , which measure 
qubit q in the standard basis. The MEASURE-x gates ter- 
minate the computation, returning the value x, if the 
measurement result x is obtained; if the result x is ob- 
tained, the state is projected onto the subspace with \x) 
for that qubit, and computation proceeds. 

Allowing termination conditional on intermediate re- 
sults, as MEASURE gates do, makes the number of queries 
stochastic. For Monte Carlo algorithms, this yields at 
best a constant speedup over algorithms with a definite 
number of queries. Nevertheless, it may yield more per- 
spicuous algorithms, and the constant speedup may be 
needed for better-than-classical performance, especially 
in the small-n regime. 

For functions of binary strings of length d, the 
AND/OR problem is to evaluate a binary tree, having 
AND at the root and d layers of alternating OR and AND 
as nodes, with a d+lst layer of n = 2 d leaves consisting of 
the values of the black-box function ordered by their in- 
put string (viewed as a binary integer). This and related 
problems have many applications, for example in game- 
tree evaluation, dynamic programming, waveform anal- 
ysis, and theorem-proving. Saks and Wigderson showed 
that "depth-first pruning" (DFP) is the best classical Las 
Vegas algorithm for AND/OR ||. DFP uses a routine 
eval (node) which returns the value of the node if it is a 
leaf, and otherwise randomly chooses a daughter of the 
node and calls itself on the subtree rooted at that daugh- 
ter. If this call returns a value for the subtree that de- 
termines the value of the node (1 if it is an OR node, if 
it is an AND node), eval returns the appropriate value; 
otherwise it calls itself on the other highest-level subtree 
of the node, and returns the value of that subtree. DFP 
itself just calls eval (root). 

Santha || showed, for read-once Boolean functions (for 
which there is a Boolean formula containing each variable 
at most once), that no classical Monte Carlo algorithm 
with all error probabilities below p can have expected 
queries q < (1 — xp)Q, where Q is the time taken by the 
optimal Las Vegas algorithm, and x = 1, 2 as the error is 
one- or two-sided. (It is not known whether a quantum 
analogue of this holds.) This is just the trivial speedup 
obtained by flipping a biased coin to decide whether to 
do the optimal Las Vegas algorithm or output a random 
bit (two-sided) or a zero (one-sided). Thus a q-query 
quantum algorithm would have to have p^ax < j(l — q) 
to be better-than-classical. DFP has worst-case expected 
queries 3 for depth- two AND/OR, so a one-query quan- 
tum algorithm would need p < 1/3 two-sided, p < 2/3 
one-sided to do better than classical. There is no one- 
query, zero-error quantum strategy for calculating OR 
for a black box Boolean function of one bit [jR^D . DFP 
has expected queries 3/2, so a one-query quantum algo- 



rithm with p < 1/6 two-sided or p < 1/3 one-sided would 
be better than classical. 

Our OR circuit is shown in Fig. 2. We use the conven- 
tion that the leftmost qubit in a ket or string of kets 
is qubit 0, the next qubit 1, and so on. Before the 
MEASURE- 1 gate on qubit 0, the state is: 

(l/2)[|0)(|/ ) + |/i)) + |l)(|/o)-|/i))]. (1) 

Thus the MEASURE- 1 has p = for outcome 1 if f = fi 
(even parity), due to destructive interference, while if the 
parity is odd, it has p = 1/2 of correctly yielding 1 (|/ ) 
and are orthogonal, and do not interfere). If the 
computation does not halt, the state becomes: 

(l/2)|0)(|/o) + |/x)). (2) 

Its squared norm is the probability that the MEASURE- 1 
gate yielded 0. First consider 9 = 0. For the even parity 
cases, this term gives the correct answer, while for the 
odd parity cases, it is equally likely to give either an- 
swer; it contributes 1/4 to p e . Thus p®° = pi 1 = 0, and 
Pe 1 = Pe° = 1/4- The error is one-sided, so it is better 
than classical (p™ a:E < 1/3). (If we had not halted the 
computation when the measurement of qubit yielded 
0, and had measured qubit 1 in the eigenbasis of a x in- 
stead of the o z -eigenbasis used in the 9 = version of 
our algorithm, one sees from (]]]) that a value of for 
the final measurement means the value of the first mea- 
surement gives the parity, while a value 1 for the final 
measurement means the value of the first measurement 
is noise. This is Deutsch's Las Vegas algorithm for par- 
ity [||.) Our algorithm also outperforms attempts to use 
2-alternative Grover search to evaluate OR; despite that 
method being asymptotically optimal for OR of many in- 
puts, it does not perform better than classically in this 
instance. Adding an X (9) before the final measurement 
gives: p° e dd = 1/2+ (^) 2 , p% ven = s 2 . Equating these 

gives a solution c = -^2=, s = —}= with p. = 0.1 for all 

° VTo vio ^ 

cases. Since p e < 1/6 two-sided with one query, this is 
also better than classical. 

To see that the 9 = array for OR minimizes p e max 
subject to the constraint pf m = p e xl = 0, consider the 
state just before the black-box function is queried: 

|*) = |^oo)|00) + KM|01) + |^io>|10> + |^ii)|H) . (3) 

The right-hand ket in each term is a state of the 
two qubits on which the function will be called, and 
Y^ij^iAi'ij) = 1- After the query, the state is: 

|Voo)|o/(o)) + |Vta>|o7(o)> + liM|i/(i)> + IV^ii)|i7(T)) ■ 

The four functions of one bit give states |0), |1), |2), |3). 
We have: 

(0|1) = (^ool^oo) + (^oil^oi) + 2Re(^io|^u) ■ (4) 
(0|2) = + (V>icMo) + 2Re(<Aool^oi) ■ (5) 

(0|3) = 2ReW>oo|V'oi> + 2Re(^io|V>n) • (6) 
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Requiring p\ x = and thus (0|3) = 0, there are optimal 
algorithms which measure |0)(0| and I— |0)(0|. For these, 
p-max _ max ^. =12 3 |(0|j)| 2 . A simple calculation using 
the Schwarz inequality and X^jiV'ijlV'ij) = 1 shows that 
P e max is minimized where (0|2) = (0|3) = 1/2, at p e max = 

For our algorithm (0|3) = (1|2) = 0, while the other 
inner products are 1/2. The states span a 3-d real sub- 
space of the 4-d complex space of two qubits. They lie, 
evenly spaced, on a cone with apex at the origin and 
opening angle 7r/2. The rest of the algorithm measures 
three orthogonal subspaces: {|10),|11)} (outcome 1 for 
MEASURE- 1 ; algorithm returns 1), { |00) } (outcome for 
final measurement on qubit 1; algorithm returns 0), and 
{|01)} (outcome 1 for final measurement on qubit 1; algo- 
rithm returns 1). The outcome 1 for the algorithm corre- 
sponds to the 3-d subspace perpendicular to 1 00) ) , while 
the outcome corresponds to {|00)}. When MEASURE- 1 
yields 1, qubit 1 lies along |0) — |1), so within the 3-d 
space spanned by the possible computer states the out- 
come "1" for MEASURE- 1 involves one dimension, and 
the result "1" for the algorithm two dimensions. Our al- 
gorithm makes a further finegrained measurement within 
these two dimensions, but we may avoid this (which could 
affect the results when the routine is called recursively) 
by converting it into one which measures a single qubit 
at the end. Such an algorithm is: HADAMARD ; ORACLE 
1 ; HADAMARD ; CHADAMARD 1 ; CONTROLLED 
X-THETA OH; MEASURE-0 1 ; MEASURE- 1 1. 

The OR algorithm was derived by restriction from a 
larger algorithm found via genetic programming. This 
algorithm computes the depth- two case (AND/OR2) of 
evaluating: 

(/(00)V/(01))A(/(10)V/(11)). 

With the number of oracle calls fixed at one, and selection 
to minimize p e max 1 GP yielded the algorithm reported in 
p| , where more detail on the GP engine used to evolve 
it may also be found. 

Hand simplification and improvement yielded the gate 
array of Fig 1. Here X(6*) has the matrix (c :— cos 8, s :— 
sine, 9 = .0749...): 

c s 

s —c 

This algorithm has error probabilities constant on or- 
bits of the automorphism group, given in Table I. 



Orbit 


Pe 


Orbit 


Pe 



1 
11 


.00560 
.28731 
.21269 


10 1 
110 1 
1111 


.28731 
.21269 
.00560 



Table I: Error probabilities (to 5 digits) for hand-tuned 
simplified AND/OR algorithm 



The automorphism group of a property P consists 
of those permutations a of its input variables which 
leave its value invariant for all assignments (all black- 
box functions). AND/OR2 has four input variables 
fo = /(00), /1 ee /(01), / 2 , / 3 ; its automorphism group is 
generated by (0 ^ 1), (2 ^ 3), (0 2, 1 <-> 3). Aut(P) 
acts on functions via f a (x) := f(a(x)) For AND/OR2, 
the orbits of this action may be labeled by representa- 
tive functions (written as strings /0/1/2/3): 0000i, OOOI4, 
OOII2, OIOI4, IIOI4, lllli. Subscripts indicate the num- 
ber of functions in the orbit. Our algorithm also has this 
automorphism group: the outcome probabilities for all its 
measurement gates are constant on orbits of the group. 
It is better than classical, since p™ ax < 1/3. 

The structure of the algorithm suggests examining its 
restriction to qubit 1, since the states with versus 1 
input in qubit are still orthogonal when qubit 1 is mea- 
sured, so don't interfere. If we fix the input for qubit 
at and remove qubit from the algorithm we may con- 
sider the algorithm to use only two qubits and to apply 
to a new function / :— /(0-) defined by f(0) = /(00) and 
/(I) = /(01)- Relabeling qubit 1 as 0, qubit 2 as 1 and 
/ as /, we get the circuit of Fig. 2 for computing /(0) 
OR/(l). 

Similarly, fix a value x for qubit 1 in the depth-two 
algorithm and view it as an algorithm for AND operat- 
ing on the one-bit function f x {y) given by f(xy). This is 
not identical to the array derived by applying De Mor- 
gan's law (N0T-(N0T-A OR N0T-B) ee A AND B) to our 
OR algorithm, but its action on all black-box functions 
is the same. So the depth-two algorithm can be loosely 
viewed as a "recursive" application of the depth-one al- 
gorithm (modified to give AND at the top level). Loosely 
speaking, it superposes values for qubit (as in the AND 
algorithm) , and calls a "function" of qubit 1 ; which func- 
tion depends on qubit 0. This is not quite accurate for 
two reasons. First, the lower-level algorithm returns a 
value to the upper level only if the MEASURE- 1 1 does 
not halt the computation. If MEASURE- 1 1 halts the com- 
putation, the superposition of qubit-0 values provides a 
random value for that bit; the lower level algorithm is 
effectively called on a randomly chosen marginal black- 
box function, and the result returned as our algorithm's 
final output. Second, if the lower-level "function call" 
is not halted the "function" of i called by the top-level 
AND routine has an /(i-)-dependent 4x4 unitary matrix 
which is not of the usual black-box form. The state after 
the function call and second HADAMARD 1 is: 

(1/2V2) [ |00)(|/ 00 ) + I/01)) + |01)(|/oo) - I/01)) 

-|10)(|/io) + I/11)) -|11>(|/ 10 >- I/11))] • (7) 

This is a superposition, with coefficients 1 / \/2 and 
— I/a/2, of states of the form ([!]) of qubits 1 and 2, with 
the qubit recording which marginal black box function, 
/(0-) or /(!•), is involved. 
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When /(OO) ^ /(01) or /(10) ^ /(ll), but not both; 
(orbits IIOI4 and OOOI4), the MEASURE- 1 has p = 1/4 of 
halting the computation with result 1 , an error if the or- 
bit is 0001. If both subtrees have odd parity (OIOI4 only), 
p = 1/2 of halting and (correctly) yielding 1, and if nei- 
ther does (0000, 1111, 0011 2 ), p = 0. This measurement 
contributes 1/4 to pg oi- 

The state after the MEASURE- 1 gate yields is given by 
the terms with |00) and 1 01) in (Q). Hadamarding qubit 
yields: 



(1/4) [ |00)(|/oo> + |/oi) -I/10) 

+ |10)(|/ 00 ) + I/01) + I/10) + I/11)) 



(8) 



The MEASURE-0 gate will terminate the computation with 
the result with probability given by the squared norm of 
the first term in this state. If measurement yields 1, the 
final state is given by the second term in (|§|). Then the 
probability of the final measurement on qubit 2 giving 
1(0) is 



(l/16)K (0) (.f)) 2 , 



(9) 



where n x (/) is the number of inputs on which / takes the 
value x. The total error probabilities (with their sources) 
are: 



Orbit 


Error probability 








1 


1/4 (M-l 0) + 1/16 (M-l 2) = 5/16 


11 


1/4 (M-l 2) 1/4 


110 1 


1/8 (M-0 1) + 1/16 (M-0 2) = 3/16 


10 1 


1/4 (M-0 2) 1/4 


1111 






'3s- 



' c+s \ 
, 2 I 



qubit 2. We obtain p ooi = ) +3> P° 101 
Equating these gives s 2 — (9 — 14-\/2/5)/26, so 9* = 
0.074909... at p e = p 000 i = poioi = 0.287315.... 

With the help of genetic programming, we found 
better-than-classical quantum gate arrays for the depth 
one and two cases of the family of properties of black-box 
functions given by alternating binary AND/OR trees. 
These circuits could constitute small-n instances of a 
scalable Monte Carlo algorithm for this family of proper- 
ties. They are also small-n instances of the bounded- 
depth AND/OR tree problems whose complexity was 
characterized in |^],[|. The Grover-based algorithms 
therein achieve marked speedups over classical means 
in the large-n regime, as the fan-in of the nodes in the 



bounded-depth trees grows; their n = 2 and n — 4 in- 
stances are not superclassical. In contrast, our arrays 
give speedups for fan- in 2. Since the uniform binary tree 
problem has fan-in 2 for all instances, this suggests that 
aspects of our gate arrays may prove useful in addressing 
this problem even for large n. If so, this would be an- 
other new quantum computational effect related to the 
OR circuit. As it stands, OR joins XOR to complete the 
set of logical properties of one-bit black-box subroutines 
which can be quantum-mechanically computed, with one 
subroutine call, more reliably than is classically possible. 
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FIG. 1. Hand-tuned version of evolved AND/OR; 
6 = 0.74909... 
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FIG. 2. An algorithm for OR 



